Seeing Stars: Exploiting Class Relationships for Sentiment Categorization with Respect to Rating Scales
نویسندگان
چکیده
We address the rating-inference problem, wherein rather than simply decide whether a review is “thumbs up” or “thumbs down”, as in previous sentiment analysis work, one must determine an author’s evaluation with respect to a multi-point scale (e.g., one to five “stars”). This task represents an interesting twist on standard multi-class text categorization because there are several different degrees of similarity between class labels; for example, “three stars” is intuitively closer to “four stars” than to “one star”. We first evaluate human performance at the task. Then, we apply a metaalgorithm, based on a metric labeling formulation of the problem, that alters a given -ary classifier’s output in an explicit attempt to ensure that similar items receive similar labels. We show that the meta-algorithm can provide significant improvements over both multi-class and regression versions of SVMs when we employ a novel similarity measure appropriate to the problem. Publication info: Proceedings of the ACL, 2005.
منابع مشابه
Seeing Stars When There Aren’t Many Stars: Graph-Based Semi-Supervised Learning For Sentiment Categorization
We present a graph-based semi-supervised learning algorithm to address the sentiment analysis task of rating inference. Given a set of documents (e.g., movie reviews) and accompanying ratings (e.g., “4 stars”), the task calls for inferring numerical ratings for unlabeled documents based on the perceived sentiment expressed by their text. In particular, we are interested in the situation where l...
متن کاملExploiting Associations between Class Labels in Multi-label Classification
Multi-label classification has many applications in the text categorization, biology and medical diagnosis, in which multiple class labels can be assigned to each training instance simultaneously. As it is often the case that there are relationships between the labels, extracting the existing relationships between the labels and taking advantage of them during the training or prediction phases ...
متن کاملStar Quality: Sentiment Categorization of Restaurant Reviews
We consider the problem of classifying reviews by overall sentiment. Rather than predict a simple positive or negative evaluation on the part of the review’s author, we employ methods to determine the author’s numerical rating on a multi-point scale ranging from one to five stars. We evaluate our approach using restaurant review data from OpenTable.com, a service for finding restaurants and mak...
متن کاملAutomatic Analysis of Document Sentiment
Sentiment analysis, which deals with the computational treatment of opinion, sentiment , and subjectivity in text, has attracted a great deal of attention. Potential applications include question-answering systems that address opinions as opposed to facts and business intelligence systems that analyze user feedback. The research issues raised by such applications are often quite challenging com...
متن کاملSeeing Stars from Reviews by a Semantic-based Approach with MapReduce Implementation
This study concerns the problem of aspect-level opinion (sentiment) mining from online reviews. The problem consists of two fundamental sub-tasks: aspect extraction (identify specific aspects of the product from reviews), and aspect rating estimation (offer a numerical rating for each aspect). Solving this problem is important and useful for many applications, e.g., providing aspect-level revie...
متن کامل